1,239 research outputs found

    Effective techniques for Indonesian text retrieval

    Get PDF
    The Web is a vast repository of data, and information on almost any subject can be found with the aid of search engines. Although the Web is international, the majority of research on finding of information has a focus on languages such as English and Chinese. In this thesis, we investigate information retrieval techniques for Indonesian. Although Indonesia is the fourth most populous country in the world, little attention has been given to search of Indonesian documents. Stemming is the process of reducing morphological variants of a word to a common stem form. Previous research has shown that stemming is language-dependent. Although several stemming algorithms have been proposed for Indonesian, there is no consensus on which gives better performance. We empirically explore these algorithms, showing that even the best algorithm still has scope for improvement. We propose novel extensions to this algorithm and develop a new Indonesian stemmer, and show that these can improve stemming correctness by up to three percentage points; our approach makes less than one error in thirty-eight words. We propose a range of techniques to enhance the performance of Indonesian information retrieval. These techniques include: stopping; sub-word tokenisation; and identification of proper nouns; and modifications to existing similarity functions. Our experiments show that many of these techniques can increase retrieval performance, with the highest increase achieved when we use grams of size five to tokenise words. We also present an effective method for identifying the language of a document; this allows various information retrieval techniques to be applied selectively depending on the language of target documents. We also address the problem of automatic creation of parallel corpora --- collections of documents that are the direct translations of each other --- which are essential for cross-lingual information retrieval tasks. Well-curated parallel corpora are rare, and for many languages, such as Indonesian, do not exist at all. We describe algorithms that we have developed to automatically identify parallel documents for Indonesian and English. Unlike most current approaches, which consider only the context and structure of the documents, our approach is based on the document content itself. Our algorithms do not make any prior assumptions about the documents, and are based on the Needleman-Wunsch algorithm for global alignment of protein sequences. Our approach works well in identifying Indonesian-English parallel documents, especially when no translation is performed. It can increase the separation value, a measure to discriminate good matches of parallel documents from bad matches, by approximately ten percentage points. We also investigate the applicability of our identification algorithms for other languages that use the Latin alphabet. Our experiments show that, with minor modifications, our alignment methods are effective for English-French, English-German, and French-German corpora, especially when the documents are not translated. Our technique can increase the separation value for the European corpus by up to twenty-eight percentage points. Together, these results provide a substantial advance in understanding techniques that can be applied for effective Indonesian text retrieval

    The Pattern Speed of the Galactic Bar

    Get PDF
    Most late-type stars in the solar neighborhood have velocities similar to the local standard of rest (LSR), but there is a clearly separated secondary component corresponding to a slower rotation and a mean outward motion. Detailed simulations of the response of a stellar disk to a central bar show that such a bi-modality is expected from outer-Lindblad resonant scattering. When constraining the run of the rotation curve by the proper motion of Sgr A* and the terminal gas velocities, the value observed for the rotation velocity separating the two components results in a value of (53+/-3)km/s/kpc for the pattern speed of the bar, only weakly dependent on the precise values for Ro and bar angle phi.Comment: 5 pages LaTeX, 2 Figs, accepted for publication in ApJ Letter

    Composition and Characterization of Refined Oil Compared with Its Crude Oil from Waste Obtained from Mangifera indica

    Get PDF
    ABSTRACT: Crude and refined seed kernel oil was extracted from Alphanso mango aiming the study of physico-chemical properties. The oil produced was refined through degumming, neutralization and bleaching process using local adsorbent (activated clay). The specific gravity, refractive index, acid value, saponification value and iodine value for both crude and refined mango seed kernel oil was determined. The solid form refined mango kernel fat resembles cocoa butter when compared with physical and chemical characteristics. They both are thus, used as a substitute for each other or just as extenders. Mango Fat can be used as edible oil apart from its general uses in manufacturing soap, cosmetic formulations and industrial applications

    A note on multiplicative sum Zagreb index

    Get PDF
    Abstract For a nontrivial (molecular) graph G, its multiplicative sum Zagreb index, denoted by π * 1 (G), is defined as the product of the sum is the degree of vertex u. In this note, we establish a relationship between π * 1 (G) of a graph and the first multiplicative Zagreb index of its total graph. Moreover, we present some bounds for π * 1 (G) in terms of some other graph parameters including the second multiplicative Zagreb index, radius, the first Zagreb index

    A note on generalized absolute CesĂ ro summability

    Get PDF
    Abstract In the present paper , we give several improvements to the result of [2] concerning absolute CesĂ ro summablity of infinite series

    Representation of people of South Asian origin in cardiovascular outcome trials of glucose-lowering therapies in Type 2 diabetes

    Get PDF
    Aims : Our aim was to investigate the proportional representation of people of South Asian origin in cardiovascular outcome trials of glucose-lowering drugs or strategies in Type 2 diabetes, noting that these are among the most significant pieces of evidence used to formulate the guidelines on which clinical practice is largely based. Methods : We searched for cardiovascular outcome trials in Type 2 diabetes published before January 2015, and extracted data on the ethnicity of participants. These were compared against expected values for proportional representation of South Asian individuals, based on population data from the USA, from the UK, and globally. Results : Twelve studies met our inclusion criteria and, of these, eight presented a sufficiently detailed breakdown of participant ethnicity to permit numerical analysis. In general, people of South Asian origin were found to be under-represented in trials compared with UK and global expectations and over-represented compared with US expectations. Among the eight trials for which South Asian representation could be reliably estimated, seven under-represented this group relative to the 11.2% of the UK diabetes population estimated to be South Asian, with the representation in these trials ranging from 0.0% to 10.0%. Conclusions : Clinicians should exercise caution when generalizing the results of trials to their own practice, with regard to the ethnicity of individuals. Efforts should be made to improve reporting of ethnicity and improve diversity in trial recruitment, although we acknowledge that there are challenges that must be overcome to make this a reality

    Higher education in Indonesia: Contemporary challenges in governance, access, and quality

    Get PDF
    This chapter presents the development of Indonesian higher education since its origins to current challenges in the fields of governance, autonomy, access, equity, quality, and internationalization. Indonesia has a massive and diversified tertiary education, including experiments in community colleges and online programs. The higher educational system remains mainly centralized, with the exception of some reforms towards financial autonomy. Insufficient public funding hinders the capacity to provide adequate teaching, research, and facilities among other aspects. The consequential rise in student fees contributes to an overrepresentation of students from Java, urban centers, and higher social classes

    Cold chains in Hanoi and Bangkok:changing systems of provision and practice

    Get PDF
    We know that patterns of domestic consumption are situated within broader systems of provision and that home appliances like the fridge freezer bridge between practices of cooking, shopping and eating, on the one hand, and increasingly global systems of food production, distribution and diet on the other. In analysing uses of fridge freezers in Hanoi and Bangkok as expressions, in microcosm, of complex and evolving processes of urbanisation and food provisioning this paper provides new insight into how specific configurations, dependencies and patterns of consumption take hold and how they vary and change. Our analysis of systems and practices in flux has the dual function of showing how household strategies reflect and contribute to more extensive transformations, and of demonstrating how these are shaped by ongoing tensions and relations between new and established forms of urban food supply and associated concepts of freshness and safety. The result is a subtle account of the multiple routes through which consumer ‘needs’ evolve

    Symptomatology and health attitudes of chronic hepatitis B patients in the USA

    Get PDF
    This study was conducted to understand the symptomatology, attitudes, and behaviours of chronic hepatitis B (CHB) patients in the USA. CHB patients enrolled in this study were recruited through multiple methods, including newspaper advertisements. Interviews were conducted in multiple languages, and all participants had a history of CHB infection for at least 6 months. Patients with documented human immunodeficiency virus or hepatitis C virus coinfection were excluded from data analyses, resulting in a total study population of 258 respondents who completed interviews between April and June 2004. The majority of monoinfected patients were male (57%) and non-Asian (92%, including 52% Caucasian, 32% African American and others). Length of diagnosis was 5.8 years for all participants (9.1-year Asian and 5.1-year non-Asian). Ninety-five per cent of CHB patients reported symptoms of differing severity in the 12 months prior to the survey. The most common symptoms included fatigue/loss of energy (90%) and loss of appetite (79%). Non-Asian patients described greater symptomatology, and were more likely than Asians to consider CHB an overriding concern in their daily activities. Patients were treated either currently or previously with interferon (IFN) described greater symptomatology than those treated without IFN. Survey results indicate that CHB patients may have greater symptomatology than recognized. Disease perceptions and treatment attitudes differ between Asian and non-Asian ethnic groups, with the former appearing to be more accepting and less concerned about the disease. Additional research about CHB symptomatology and health attitudes by ethnicity is needed to ensure that individuals with CHB are educated on the potential health risks and the availability of current treatment options
    • 

    corecore